Joint Reference and Relation Extraction from Legal Documents with Enhanced Decoder Input
نویسندگان
چکیده
Abstract This paper deals with an important task in legal text processing, namely reference and relation extraction from documents, which includes two subtasks: 1) extraction; 2) determination. Motivated by the fact that subtasks are related share common information, we propose a joint learning model solves simultaneously both subtasks. Our employs Transformer-based encoder-decoder architecture non-autoregressive decoding allows relaxing sequentiality of traditional seq2seq models extracting references relations one inference step. We also method to enrich decoder input learnable meaningful information therefore, improve accuracy. Experimental results on dataset consisting 5031 documents Vietnamese 61,446 show our proposed performs better than several strong baselines achieves F 1 score 99.4% for task.
منابع مشابه
Temporal information extraction from legal documents
The aim of this paper is to analyze what kinds of temporal information can be found in different types of legal documents. In particular, it provides a comparison of different legal document types (case law, statute or transactional document) and how one can do further reasoning with the extracted temporal information.
متن کاملUnsupervised Relation Extraction From Web Documents
The IDEX system is a prototype of an interactive dynamic Information Extraction (IE) system. A user of the system expresses an information request for a topic description which is used for an initial search in order to retrieve a relevant set of documents. On basis of this set of documents unsupervised relation extraction and clustering is done by the system. The results of these operations can...
متن کاملReference Line Extraction from Form Documents with Complicated Backgrounds
Form document analysis is one of the most essential tasks in document analysis and recognition. One of the most fundamental and crucial tasks is the extraction of the reference lines which are contained in almost all form documents. This paper presents an efficient methodology for the complicated grey-level form image processing. We construct a non-orthogonal wavelet with adjustable rectangle s...
متن کاملCombining NLP Approaches for Rule Extraction from Legal Documents
Legal texts express conditions in natural language describing what is permitted, forbidden or mandatory in the context they regulate. Despite the numerous approaches tackling the problem of moving from a natural language legal text to the respective set of machine-readable conditions, results are still unsatisfiable and it remains a major open challenge. In this paper, we propose a preliminary ...
متن کاملCatch Phrase Extraction from Legal Documents Using Deep Neural Network
This paper is based on finding and extracting important key phrases (catchphrase) from a document from which the the document can be summarized. This is important as this will reduce time consumption in summarization of documents. This work is realizedwith the help of deep neural network to train anmodel for recognizing such important key phrases based on various calculated parameters.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Cybernetics and Information Technologies
سال: 2023
ISSN: ['1311-9702', '1314-4081']
DOI: https://doi.org/10.2478/cait-2023-0014